Optimal Control Algorithm for Subway Train Operation by Proximal Policy Optimization
نویسندگان
چکیده
With the increasing scale of urban subway, total energy consumption subway has increased dramatically and poses a great challenge to comfort passengers punctuality train operation. In order ensure on-time operation passenger comfort, at same time reduce operation, this paper proposes Proximal Policy Optimization (PPO)-based optimization algorithm for optimal control Firstly, reinforcement learning architecture is constructed with position speed as state, objectives, constraint. The proposed model trained by PPO algorithm, reward scaling added training process accelerate improve efficiency algorithm. experimental results show that can effectively while ensuring
منابع مشابه
Multi-objective Optimization Improved GA Algorithm and Fuzzy PID Control of ATO System for Train Operation
In order to solve the problem that automatic train operation control system considering the single factor and control is not easy to be accurate, a multi-objective optimization (MO) based on improved genetic algorithm (GA) and fuzzy PID control method is proposed in this paper. Firstly, based on train operation characteristics, a multi-objective model of train operation process is established. ...
متن کاملHolistic optimization of train traffic by integration of automatic train operation with centralized train management
Nowadays, railways are confronted with numerous pressing problems, including capacity optimization, energy conservation, cost reduction and improving customer satisfaction. While the traditional railway is a very safe means of transportation, it still cannot meet all these requirements. Hence there are high interests in two available systems to overcome these challenges: Traffic Management Syst...
متن کاملProximal Policy Optimization Algorithms
We propose a new family of policy gradient methods for reinforcement learning, which alternate between sampling data through interaction with the environment, and optimizing a “surrogate” objective function using stochastic gradient ascent. Whereas standard policy gradient methods perform one gradient update per data sample, we propose a novel objective function that enables multiple epochs of ...
متن کاملTrain Coordinated Optimization Operation with Regenerative Braking
Reducing the traction energy consumption plays an important role in railway energy saving. The traditional train optimization operation theories, taking the coasting as the means of energy saving, can’t fit complicated lines and is suitable for the single train only. Meanwhile, the highly complicated line will cause the risk of train collision. On the background of the “11th Five-Year” State Sc...
متن کاملOptimal Operation of CHP Combined Heat Generation Systems Using the Crow Search Optimization Algorithm
Energy efficiency of power plants is less than 60% However, the efficiency of the CHP units can be up to 90 %.CHP units in addition to high efficiency, They reduce environmental pollutants by 13 to 18 percent. The purpose of this thesis is to use the simultaneous power and power generation plants to reach the optimal economic destination for Genco And to maximize economic profit And to minimize...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied sciences
سال: 2023
ISSN: ['2076-3417']
DOI: https://doi.org/10.3390/app13137456